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Abstract 

Formal concept analysis (FCA) is a well-founded method for data anal¬ 
ysis and has many applications in data mining. Pattern structures is an 
extension of FCA for dealing with complex data such as sequences or 
graphs. However the computational complexity of computing with pattern 
structures is high and projections of pattern structures were introduced 
for simplifying computation. In this paper we introduce o-projections of 
pattern structures, a generalization of projections which defines a wider 
class of projections preserving the properties of the original approach. 
Moreover, we show that o-projections form a semilattice and we discuss 
the correspondence between o-projections and the representation contexts 
of o-projected pattern structures. 

KEYWORDS: formal concept analysis, pattern structures, representa¬ 
tion contexts, projections 


1 Introduction 

A significant part of recorded data represents phenomena in a structured way, 
e.g., a molecule is better represented as a labeled graph than as a set of at¬ 
tributes. Pattern structures are an extension of FCA for dealing with such kind 
of data HEna. Such a pattern structure is defined by a set of objects, a set of 
descriptions associated with the set of objects, and a similarity operation on de¬ 
scriptions, matching a pair of descriptions to their common part. For instance, 
the set of objects can contain molecule names, the set of descriptions contains 
fragments of molecules, and the similarity operation taking two sets of graphs to 
a set of maximal common subgraphs. The similarity operation is a semilattice 
operation on the set of descriptions. It allows one to deal with data (objects 
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and their descriptions) in a similar way as one deals with objects and their in¬ 
tents in standard FCA. Such kind of formalization allows one to describe many 
types of data, however processing can be computationally very demanding. For 
example, pattern structures on sets of graphs |S1 [21 IS] is based on the operation 
of finding maximal common subgraphs for a set of graphs, which is ^P-hard. 

To deal with this complexity and to have a possibility to process most of 
the data, projections of pattern structures were introduced [3]. Projections are 
special mathematical functions on the set of descriptions that simplify the de¬ 
scriptions of objects. This approach reduces the number of concepts in the pat¬ 
tern lattice corresponding to a pattern structure. However, it does not impact 
the computational worst-case complexity of the similarity operation. Moreover, 
it cannot remove concepts of special kinds from the “middle” of the semilattice 
which can be important in some practical cases, e.g., concepts containing too 
small graphs can be considered useless but they cannot be removed with projec¬ 
tions. For example, in [T] concepts having intents that include short sequences 
of patient hospitalisations have little sense. Hence, short sequences could be “re¬ 
moved” from the intent, but the descriptions of objects, i.e., patients, usually 
include only one long sequence and should not be changed. 

In this paper we introduce o-projections of pattern structures, a generaliza¬ 
tion of projections of pattern structures, that allow one to reduce the computa¬ 
tional complexity of similarity operations. They also allow one to remove certain 
kinds of descriptions in the “middle” of the semilattice while the descriptions 
of the objects can be preserved. By introducing o-projections of pattern struc¬ 
tures, we correct also some formal problems of projections of pattern structures, 
which will be discussed later. 

The main difference between o-projections and projections is that in o- 
projected pattern structures we modify the semilattice of descriptions, while 
in the case of projected pattern structures we can modify only the descriptions 
of single objects. It should be noticed that most of the properties of projec¬ 
tions are valid for o-projections. However, the relation between representation 
contexts, a reduction from pattern structures to FCA, and projections is quite 
different from the relation between representation contexts and o-projections. 
The introduction and study of this difference is one of the main contributions of 
this work. In addition we have discovered the fact that the set of o-projections 
of a pattern structure forms a semilattice. From a practical point of view it al¬ 
lows one to apply a set of independent o-projections, e.g., o-projections obtained 
from several experts, to a pattern structure. 

This work further develops the methodology introduced in [T], where it was 
applied for the analysis of sequential pattern structures by introducing projec¬ 
tions that remove irrelevant concepts. 

The rest of the paper is organized as follows. In Section |2| we introduce the 
definitions of a pattern structure, representation context of a pattern structure, 
and discuss how one can compute with pattern structures along the lines of FCA. 
Section|3|introduces projections and o-projections of a pattern structure, defines 
the partial order on o-projections and shows that this order is a semilattice. At 
the end of this section the relation between o-projections and representation 
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contexts of o-projected pattern structure is discussed. Finally, we conclude the 
paper and discuss furture work. 


2 Pattern Structures 

In FCA a formal context {G,M,I), where G is a set of objects, M is a set of 
attributes, and I C G x M is a binary relation between G and M, is taken 
to a concept lattice £,{G,M,I) [3]. For non-binary data, such as sequences or 
graphs, lattices can be constructed in the same way using pattern structures [5]. 

Definition 1. A pattern structure ¥ is a triple {G,{D,r\),S), where G,D are 
sets, called the set of objects and the set of descriptions, and S : G ^ D maps an 
object to a description. Respectively, (D,n) is a meet-semilattice on D w.r.t. 
n, called similarity operation such that 5{G) := {<5(5) | g G G} generates a 
complete sub semilattice (I? 5 ,n) of{D,n). 

For illustration, let us represent standard FCA in terms of pattern structures. 
The set of objects G is preserved, the semilattice of descriptions is (p(M),n), 
where p{M) denotes the powerset of the set of attributes M, a description is 
a subset of attributes and fl is the set-theoretic intersection. If a: = {a, b, c} 
and y = {a, c, d} then xr\y = xr\y= {a, c}, and d : G —>■ p(M) is given by 
S{g) = {m € M \ {g,m) € I}. 

Note that Definition [T] has an important partial case where {D, □) is a com¬ 
plete meet-semilattice. In this case the semilattice (D, 5 ,n) is necessarily com¬ 
plete. First, in practical applications one often needs finite lattices, which are 
always complete. Second, in many practical cases one can easily extend an in¬ 
complete semilattice to a complete one by introducing some extra elements. For 
example, given an incomplete semilattice w.r.t containment order on the inter¬ 
val (a, b), one can add a and b to obtain the interval [a, b], which is a complete 
semilattice. In this paper some of the statements hold only for the partial case 
of {D, n) being a complete meet-semilattice. 

The Galois connection for a pattern structure (G, {D, □), d), relating sets of 
objects and descriptions, is defined as follows: 

forACG 

geA 

:= {g e G I d E d(g)}, for d e D 

Given a subset of objects A, A* returns the description which is common to 
all objects in A. Given a description d, d* is the set of all objects whose descrip¬ 
tion subsumes d. The natural partial order (or subsumption order between de¬ 
scriptions) E on I? is defined w.r.t. the similarity operation fl: c E d cHd = c 
(in this case we say that c is subsumed by d). In the case of standard FCA the 
natural partial order corresponds to the set-theoretical inclusion order, i.e., for 
two sets of attributes x and y x Gy ^ x G y. 
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[ 1 , 1 ] [ 1 , 1 ] 
[ 2 , 2 ] [ 2 , 2 ] 
[3,3] [2,2] 



(a) An interval context. (b) An interval pattern lattice. 

Figure 1: An interval pattern structure and the corresponding lattice. 


Definition 2. A pattern concept of a pattern structure (G, (Z?,n),(5) is a pair 
{A,d), where A C G and d € D such that A^ = d and d* = A; A is called the 
pattern extent and d is called the pattern intent. 

As in standard FCA, a pattern concept corresponds to the maximal set of 
objects A whose description subsumes the description d, where d is the maximal 
common description of objects in A. The set of all pattern concepts is partially 
ordered w.r.t. inclusion of extents or, dually, w.r.t. subsumption of pattern in¬ 
tents within a concept lattice, these two antiisomorphic orders making a lattice, 
called pattern lattice. 

2.1 Running Example 

The authors of [5] have used interval pattern structures for gene expression 
analysis. Let us consider an example of such pattern structures. In Figure fla] 
an interval context is shown. It has three objects and two attributes. Every 
attribute shows the interval of values the attribute can have. If we have two 
objects, then a numerical attribute can have all values from the interval of this 
attribute in the first object and from the interval of this attribute of the second 
object. Consequently, the similarity between two intervals can be defined as 
a convex hull of the intervals, i.e. [a, 5] □ [c, d] = [min(o, c), max(&, d)]. Then, 
given two tuples of intervals, the similarity between these tuples is computed as 
a component-wise similarity between intervals. 

In this example, we have the pattern structure (G, (I?, n), d), where G = 
{ 91 J 92 , 93 }, the set D is the set of all possible interval pairs with the similarity 
operation described above, and 5 is given by the context in Figure [T^ i.e., 
%i) = ([1,1]; [1,1]) and 5{gi) □ ^( 32 ) = ([1, 2]; [1, 2]). 

Figure [lb] shows the pattern lattice of the interval context in Figured^ One 
can check that the extents and the intents in this lattice are connected by means 
of the Galois connection given above. The partial order in the semilattice of 
intervals is given by “the smaller the interval, the larger the description with 
this interval”, i.e., the former description gives more certainty about the values 
than the latter. 
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2.2 Representation Context of a Pattern Structure 

Note that any pattern structure can be represented by a formal context with 
the concept lattice isomorphic to the lattice of the pattern structure. Below we 
introduce a representation context of a pattern structure and its properties in 
the line of [5]. 

Given a pattern structure (G, (£>, n), J), we denote hy Dg C D the set of 
all intents of the concept lattice, i.e., Dg = {d D \ {3X C G) fl 5{g) = d}. 

gGX 

Since {Dg, n) is a complete subsemilattice of {D, n), tor X C D a join operation 
U can be defined as follows: 

ijx=n{rf G Dg I (Va; G X)x C d}. 

Given this join operation, (Z35,n,U) is a complete lattice. We say that a set 
M C D is U-dense for {Dg, n) if every element in Dg is of the form UX for some 
X C M. For example, M = Dg is always U-dense for Dg. 

Definition 3. Given a pattern structure P = {G,{D,r\),6) and a set M C D 
U-dense in Dg, a formal context {G,M,I) is called the representation context 
o/P, if I is given by I = {{g,m) G G x M | m U S{g)}. The representation 
context o/P is denoted by ]R(P). 

The next theorem establishes a bijection between the pattern concepts in the 
lattice of pattern structure P and the concepts in the lattice of the representation 
context ]R(P). Here, the ideal of element d £ D is denoted hy f d = {e £ D \ 
e U d}. 

Theorem 1 (Theorem 1 from [3]). Let P = (G, {D, □), S) be a pattern structure 
and let R(P) = (G, M, I) be a representation context o/P. Then for any A C G, 
B C M, and d £ D the following conditions are equivalent: 

1. {A, d) is a pattern concept of IP and B =/ d fl M. 

2. {A, B) is a formal concept of ]R(P) and d = \_\B. 

Example 1. A representation context for the pattern structure given in Fig¬ 
ure [7] can be given by the set M where every element m £ M is of the form 
([—oo, a]; [—oo, -|-oo]) or ([—oo, -foo]; [6, -foo]), and a,b £ {1, 2,3}. 

In fact, the element ([—oo, -foo]; [a, -foo]) corresponds to the attribute ’m 2 > 
a ’ in the case of the interordinal scaling n of numerical data. Another represen¬ 
tation context can be constructed from the intents of join-irreducible concepts of 
the lattice in Figure \lb[ These two representation contexts of the pattern struc¬ 
ture related to Figure\^ are shown in Figures and\2b[ It can be seen that the 
resulting lattices, e.g., the lattice in Figure \2(\ are isomorphic to the lattice in 
Figure [T&l 

It should be noticed that in some cases the representation context is hard 
to compute. For example, in case of numerical data with the set of all values 
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(c) A concept lattice for the context if Figure [2bl 


Figure 2: Possible representation contexts for the pattern structure in Figure [T] 
and the concept lattice for the context in Figure [2bl 


W, to construct representation context, one needs to create 2\W\ + 1 binary 
attributes, which can be much more than the number of original real-valued 
attributes. The authors of [6] have shown that pattern structures provide more 
efficient computations than the equivalent approach based on FCA and scaling, 
which can be considered as a way to build representation context of interval 
pattern structures, e.g., see Figure [2al 

In case of graph data the set of attributes of the representation context 
consists of all subgraphs of the original graph descriptions, which is hard to 
compute [5]. 

2.3 Computation of Pattern Lattices 

Nearly any algorithm for computing concept lattices from contexts can be 
adapted to compute pattern lattices from pattern structures. To adapt an al¬ 
gorithm, every set intersection operation on attributes is replaced by the semi¬ 
lattice operation FI on corresponding patterns, and every subset checking is 
replaced by the semilattice order C checking, in particular, all (•)' operations 
are replaced by (■)*. For example, let us consider a modified version of Close- 
by-One (CbO) algorithm [7]. 

Algorithm [T] shows the listing of the modified part of CbO. Here the canonical 
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Function CloseByOne Int) 

Data: P = (G, (D, n), J), the extent Ext and the intent Int of a 
concept. 

Result: All canonical ancestors of (Ext, Int) in the concept lattice, 
foreach S C G, S Ext do 

Newint < — n S{g) ; /* n - the similarity */ 

g&S 

NewExt i — {5 G G I Newint C < 5 ( 5 )} ; /* IZ - the 

subsumption */ 

if IsCanonicExtension(_Ea;i, NewExt) then 
SaveConcept {[NewExt, Newint)); 

CloseByOne (NewExt,Newint); 

/* Looking for all concepts of the concept lattice */ 

CloseByOne(0,T); 


Algorithm 1: The version of the Close-by-One algorithm computing the 
pattern lattice of a pattern structure P. 


extension IsCanonicExtension and canonical order are defined on the set of 
objects and hence are the same as in [7]. We can see that only lines 3 and 4 are 
modified. In these lines the set intersection operation and the subset relation 
checking are replaced by the corresponding operators of a pattern structure. 

3 Revised Projections of Pattern Structures 

Pattern structures are hard to process due to the large number of pattern con¬ 
cepts in the pattern lattice and the algorithmic complexity of the similarity 
operation □. Projections of pattern structures “simplify” to some degree the 
computation and allow one to work with “simpler” descriptions. In fact, a pro¬ 
jection can be considered as a mapping for pruning descriptions with certain 
mathematical properties. These properties ensure that a projection of a semi¬ 
lattice is a semilattice and that the concepts of a projectecB pattern structure 
are related to the concepts of the original pattern structure [3] . 

In this section we introduce o-projected pattern structures (“o” coming from 
“order”), i.e. a revision of projected pattern structures in accordance with [T]. 
We discuss the properties of o-projected pattern structures and relate them to 
the projected pattern structures from [3]. The notion of (o-)projected pattern 
structure is based on a kernel operator (a projection). 

Definition 4 ([3]). A projection D ^ D is a kernel (interior) operator 
on the partial order [D,Q), i.e. it is (1) monotone (x Q y ^ V'(^) E i’iu)), 
(2) contractive ('ijj{x) E x) and (3) idempotent (il}[%([x)) = 'ip{x)). 

^ We use the expression “a projected pattern structure” instead of “a projection of a 
pattern structure” to distinguish between projection as an operator and as the result of 
applying the operator to a pattern structure. 
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D = {x,2/,0,±} 



ip -.X ^ x,y y, 

2 ±, ± _L 

ipix^y) = ip(z) = -L ^ 

z = ipix) r\ip{y) 


Figure 3: Contrexample to Proposition 1 from [3]. 


Given a projection ip we say that the fixed point of ip is the set of all elements 
from D such that they are mapped to themselves by ip. The fixed point of ip 
is denoted by ip{D) = {d & D \ ip{d) = d}. Note that, if ip{d) ^ d, then there 
is no other d such that ip{d) = d because of idempotency. Hence, any element 
outside the fixed point of the projection ip is pruned. 

3.1 Definition of Projected Pattern Structures 

Let us first consider the projected pattern structure w.r.t. a projection ip accord¬ 
ing to [3]. Given a pattern structure P = (G, {D, n), d) and a projection ip on D, 
the projected pattern structure is defined as (G, (d?, □),■)/) o d). As we can see, a 
projection only changes the descriptions of the objects but not the underlying 
semilattice {D, n). There are two problems with this definition of the projected 
pattern structures. First, it is necessary to restrict the class of projections given 
by Definition |4] in order to ensure the property ip{xr\y) = ip{x) r\ip{y). Second, 
the complexity of computing □ can be very high, but with this kind of pro¬ 
jected patten structures we cannot decrease the algorithmic complexity. Below 
we discuss these two points. 

In |3] (Proposition 1) the following property of the projection operator is 
discussed: given a semilattice {D, n) and a projection ip on D, for any two 
elements x and y from D one has ip{x n y) = ip{x) □ ipiy). Let us consider 
the example in Figure |3] with the meet-semilattice D = {x, y, z, T} given by its 
diagram and the projection ip given by the dotted lines. It is easy to see that 
ip{x r\y) = ± z = ip{x) n ip{y). One way of solving this problem is to give 
additional conditions on projection ip that would imply the required property. 
An important example is the following condition: for all x, ?/ S D if x < y and 
ip{ij) = y, then ip{x) = x. This kind of solution respects the intuition behind 
the definition of the projected pattern structure in [3], according to which the 
initial descriptions of objects are changed, but the similarity operation n is not 
changed. 

Another way of solving the problem above is to generalize the definition of 
the projected pattern structure, and we proceed in this way in the next section. 
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by allowing to modify the similarity operation on descriptions. 


3.2 Definition of o-projected Pattern Structures 

Below we propose a definition of o-projected pattern structures by means of a 
kernel operator ip. The definition takes into account the problems discussed 
above. In the o-projected pattern structure we substitute the semilattice of 
descriptions by its suborder (the letter “o” comes from “order”) with another 
similarity operation, which can be different from the initial one. 

Let us first note that, given a meet-semilattice D and a kernel operator ip, 
the fixed point ip{D) is a semilattice w.r.t. to the natural order on D. 


Theorem 2. Given a semilattice {D, n) and a kernel operator ip, the fixed point 
{ip{D),r\fi) is a semilattice w.r.t. the natural order on (£),□), i.e., di G d 2 
di n (i 2 = di- exists for a set X C D, then \~\.,p'ipix) exists and is given 

x^X 


by 


= ipi\~\x) (1) 

xGX ^ 6 ^ 


Proof. Let us denote d= \~\ x. Since (Va; S X)d C x, one has (Va: € X)ip{d) C 

xex 

ip{x). Let us show that for any p G ip{D), i.e. ip{p) = p such that (Vx G X)p C 
ip{x), we havep C V^(d), i.e., that ip{d) = \~\.,pip{x). 

xGX 

Since (Vx G X)p C ip{x) then (Vx G X)p C x. Since d = fl has 

xex 

p Q d. Thus, p = ipip) E "fpid) and ip{d) is the minimum of the set ip{X), i.e. 
ip{D) is a semilattice and the Eq. ([T|) holds. □ 

Corollary 1. Given a complete subsemilattice D of[D,r\) and a kernel operator 
Ip on D, the image of D is a complete sub semilattice ip{D) of the fixed point 
{iPiD),n^). 

Since according to Theorem[5]^/>(id) is a semilattice and according to Corol¬ 
lary [I] '0(15,5) is a complete semilattice, we can define an o-projected pattern 
structure as a pattern structure with ip{D) as a semilattice. 


Definition 5. Given a pattern structure P = (G, (D, n),d) and a kernel op¬ 
erator Ip on D, the o-projected pattern structure ip(F) is a pattern structure 
(G, {ip{D), n^). Ip o S), where ip{D) = {d G i5 | ip{d) = d} and Vx, j/ G D, x fly, 
y := ip{x n y). 


In the o-projected pattern structure the kernel operator ip modifies not only 
the descriptions of the objects, but also the semilattice operation, i.e., the semi¬ 
lattice (0(Z5),n^) is not necessarily a subsemilattice of (d5,n) and so it is not 
always true that x □ y = x fl^ y in D. 


Example 2. Let us define an o-projection for the interval pattern structure 
from Subsection \2.1[ Let us suppose that the aggregated size of a pattern, i.e., 
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the sum of the lengths of the intervals in the pattern, should be less than 2. First, 
we should define the corresponding kernel operator if : D ^ D. Thus, if an 
aggregated length of a pattern p is less than 2, then if{p) := p, otherwise if(j>) := 
_L = ([- 00 ,+cxd]; [— 00 ,+cxd]). For instance, 1]; [1,1])) = ([1j 1]; [1; 1])) 

while '0(([1,2]; [1,2])) = ([—oo,+oo]; [—oo,+oo]), because it has two intervals of 
length 1, i.e., the aggregated size is equal to 2. 

Let us consider the o-projected interval pattern structure (G, {il)[D),n,f,),if o 
5). It is clear that if o 5 = 5, thus this o-projected interval pattern structure 
cannot be expressed as a projected pattern structure. 

The concepts of a pattern structure and a projected pattern structure are 
connected through Proposition [TJ This proposition can be found in [5] , but 
thanks to Theorem [21 it is also valid in our case. 

Proposition 1. Given a pattern structure P = (G, (H, n),5) and a kernel op¬ 
erator if on D: 

1. if A is an extent in ^/’(P), then A is also an extent in P. 

2. if d is an intent in P, then if id) is also an intent in '0(P). 

It is easy to see that the other propositions from [3] concerning projected 
pattern structures hold for the o-projected pattern structures as well. Below 
we cite Proposition 3 from [3] that relates implications in a pattern structure 
and those in an o-projected pattern structure. We skip the propositions related 
to supervised classification with projected pattern structures by means of hy¬ 
potheses, because it is out of the scope of this paper. However, they are valid 
in the case of o-projected pattern structures and can be proven with the help of 
Theorem |2l 

Proposition 2 (Proposition 3 from [3]). Let a,b € D. If if (a) —>■ if{b) and 
if{b) = b then a ^ b, where x ^ y for all g € G {x Q S{g) implies y C <^( 5 )) 

3.3 Order of Projections 

In this subsection we limit ourselves to the practically important case when a 
set of descriptions is a complete semilattice. We can consider projections as 
a means of description pruning in (T), n). Indeed, given a semilattice (T), n) 
and a projection if on this semilattice, the set D can be divided into two sets 
D = {d G D \ if{d) = d} U {d G D \ if{d) ^ d}, i.e., the fixed point of if and 
the rest. It can be seen that the intents of the o-projected pattern structure 
if{{G, {D, n), i5)) are in the fixed point of if, i.e., all elements of the form if{d) ^ d 
are discarded. We recall that by if{D) = {d G D \ if{d) = d)} we denote the 
fixed point of if. But under which condition do we have that for any Di C D 2 
there is a projection if of D 2 such that if{D 2 ) = Di ? The following theorem 
gives necessary and sufficient conditions for such a property. 

Theorem 3. Given a complete semilattice (D,A), with the natural order <, 
and Dg C D, there is a projection if : D ^ D such that if{D) = Dg, if and only 
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z/ _L G Dg and for any X C Dg Q D, one has \J X £ Dg, where J- := /\D and 
\/X = /\{deD\{yx€ X)d>x}. 

Proof. 1. Given a projection such that 'ip{D) = Dg, ± £ Dg because of 
contractivity of if, i.e., = -L. Let us suppose that there is a set 

X C Dg, i.e., (Vx £ X)il){x) = x such that if(f\/ X) ^ \/ X. Then, 
(Vx € X){x < \/ X => X < ijj{\/ X) < V-^)- It is a 

monotonicity contractivity 

contradiction, since V ^ is the supremum of X. Hence for any X C Dg 
we have ipf^ X) = \J X. 

2. Given Dg C D such that -L £ Dg and for any X £ Dg, one has \J X £ Dg, 
let us construct the corresponding projection if. First, if{d £ Dg) := d 
and for all d £ D\Dg we should have if{d) d. For an element d £ D\Ds, 
let us consider the set Sd = {x £ Dg \ x < d}, which is not an empty set 
since J- £ Dg. We know that \/ Sd £ Dg and by dehnition of V we have 
\J Sd < d. Then we set if{d) := \J Sd. 

Let us show that the function z/i is a projection of D. Idempotency 
and contractivity are satisfied by the construction of if. Let us check 
monotonicity. Let us take any a,b £ D such that a > b. Then, if 
if{a) = a, then if{a) = a > b > if{b), i.e., the monotonicity holds. 
If if {a) ^ a, then if{a) = \J Sa hy construction. Hence, if if{b) = b, 
then b £ Sa, i.e., if{a) > if{b). Finally, if if{b) 7 ^ b, then Sb C Sa, be¬ 
cause if d S Sb, i.e., d < b, then d < b < a, i.e. d £ Sa. In this case, 
if{a) =\/ Sa >\/ Sb = if{b). 

□ 

Corollary 2. Given a complete semilattice {D,A), with the natural order <, 
and a subset Dg C D such that T £ Dg and for any X C Dg, one has \J X £ Dg, 
the poset {Dg, <) is a complete semilattice. 

Proof. According to Theorem [3] there is a projection if : D ^ D such that 
if{D) = Dg. Then, according to Theorem [2] is a semilattice. □ 

Since a projection of D can be considered as a mapping with the fixed point 
if{D), we can introduce an order w.r.t. this fixed point. 

Definition 6 . Given a complete semilattice {D, n) and two projections ifi and 
if 2 in D, we say that ifi < if 2 if ifi{D) C if 2 (D). 

However in some cases, it is more convenient to order projections w.r.t. a 
superposition of projections or their “generality”. 

Definition 7. Given a complete semilattice {D, n) and two projections ifi and 
if 2 in D, we say that ifi < if 2 if there is a projection if : if 2 {D) —>■ if 2 {D) such 
that if I = if o if 2 . 

It can be seen that these two definitions yield the same ordering. 
Propositions. Definitions\d\ an dQ are equivalent. 
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Proof. 1. Let ipi = oij} 2 - Since if \s a, projection in ' 02 (-D), then ifi{D) = 
ip{if2{D)) C ii)2{D). 

2. Let ifi{D) C ip 2 {D). Let us denote by (-ji and (•)2 the operations in 
{ifi{D), n^j) and (' 02 (L*), ni/, 2 ), respectively, and let us denote Di = ifi{D) 
the hxed points of ifi, where i G { 1 , 2 }. 

Let us build if : D 2 ^ Di equal to ifi in D 2 , i.e., for all d £ D 2 we 
set if{d) := ifi{d). Since ifi is a projection in D, if is a projection in 
D 2 (the natural order is the same). Since Di is the fixed point of ifi 
then '01 (D 2 ) C Di. However, since Di C D 2 and ifi{Di) — Di then 
' 0 i(-D 2 ) = Di, i.e., there is a projection if such that ifi = if o if 2 - 

□ 

Example 3. Let us return to Example We change the threshold for the 
aggregated size. In Example\^it was set to 2 (ifai= 2 ), but we can change it to 5 
(ifai=b) or 10 (ifai=io). The higher the threshold, the more possible descriptions 
are projected to themselves, i.e., belong to the fixed point of the projection. Thus, 
we have lfal =2 < < ' 0 a/=lO- 

Thanks to Proposition [T] it can be seen that, given a pattern structure P, if 
we have two projections ifi < if 2 , then the set of pattern extents of '0i(P) is a 
subset of the set of pattern extents of ■ 02 (P), i.e., the smaller the projection, the 
smaller the number of concepts in the corresponding projected pattern structure. 

Now it can be seen that projections actually form a semilattice with respect 
to the previously defined order. 

Proposition 4. Projections of a complete semilattice {D, n) ordered by Defi- 
nition\^ or^form a semilattice (F, A), where the semilattice operation between 
01 , 02 G F is given by if i A if 2 = if 3, iff ifziD) = if i{D) if 2 {D) . 

Proof. It follows from the definitions that if for any 0i and 02 the projection 
03 exists, then projections of D form a semilattice. Let us describe the corre¬ 
sponding 03 . 

Let us denote Di = ifi{D) and D 2 = 02 (^) and £>3 = Di fl D 2 . Let us 
suppose that there exist x,y £ such that xUy ^ £> 3 . But as £>3 C Di 
and £>3 C D 2 , then, since 0i is a projection of D and if 2 is a projection of 
D, we have x Li y £ Di and x Li y £ D 2 , i.e., x Li y £ Di Ci D 2 = D 3 . Thus, 
(Vx, y £ £> 3 )x Uy £ D 3 . Then, according to Theorem [3] there is a projection 03 
such that if 3 (D) — D 3 . □ 

3.4 Analogue of Theorem II for Revised Projections 

An important question is how a projection changes the representation context of 
a pattern structure? We limit the discussion of this question for the case when 
a set of description D is a complete semilattice. In [3] the authors describe this 
change by means of Theorem 2. The formulation of this theorem was corrected 
in . Below we give the corrected version of the theorem. 
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Theorem 4 (Theorem 2 from [3]). For two pattern structures {G,{D,r\),Si) 
and {G,{D,r\),d 2 ) the following statements are equivalent: 

1. 62 = 'ip o Si for some ip on {D, n). 

2. {\/g G G)(^ 2 (< 7 ) E Si{g)) and there is a representation context {G,M,I) 
of (G, (I?, n), 5i) and some N G M such that {G,N,I n (G x N)) is a 
representation context of (G, (D, n), < 52 )- 

In Theorem U] one compares two pattern structures that differ in mapping 
functions. However, in the o-projected pattern structures we can modify the 
lattice structure itself. How can we adjust the formulation of Theorem in 
such a way that it can he applied to revised projections? First, we should notice 
that in a pattern structure and in an o-projected pattern structure the set of 
objects is preserved. Second, the minimal representation context of a pattern 
structure can have less attributes than the minimal representation context of 
an o-projected pattern structure, as shown in Example ID 

Example 4. Let M = {a, &, c} and the description semilattice be D = (2^^,n). 
Let Ip : 2^ —> 2^ be the following mapping: 'ip({a}) = 0 and for any A {a} 
we put 'ip{A) = A. This projection is visualised in Figure \4a\ by dashed ar¬ 
rows. Let us consider the following pattern structure ({ffi, 32 , 53 }, (2*^, fl), {gi 1 —>■ 
{a, &}, g 2 ^ {a, c}, 33 {&, c}}. 

The minimal representation context of this pattern structure contains 3 at¬ 
tributes M = {a, b, c}, while the minimal representation context of the o-projected 
pattern structure contains 4 attributes My, = {&, c, a&, ac}. The corresponding 
contexts are shown in Figures \4^ and\4c\ 

We can see that to introduce the “revised Theorem 2” from [3] we have to 
define a special relation between contexts. 

Definition 8. Given two contexts Ki = (G,Mi, Ji) and K 2 = {G, M 2 , 12 ), Ki 
is said to be simpler than K 2 , denoted by Ki <5 1 K 2 , if for any miy G Mi there 
is a set B 2 Q M 2 such that ({m-iy})^ = {^ 2 )^- Here by and we denote 
the derivation operators in the contexts Ki and 1 K 2 , respectively. 

Example 5. The context in Figure \4^ is smaller w.r.t. Definition\^than the 
context in Figure [7^ because every column of the context in Figure is the 
intersection of a subset of columns of the context in Figure \4b\ 

This relation between contexts is a preorder. Indeed, it is reflexive, tran¬ 
sitive, but not necessarily antisymmetric: given two contexts Ki and 1 K 2 , if 
Ki and K 2 have the same closure system of attributes, i.e., the same set of 
intents in the concept lattice, then according to the definition Ki <s K 2 and 
Ki IK 2 . However, we can consider only the context with the minimal number 
of attributes in the class of equivalence, i.e., the attribute-reduced context. For 
simplicity in the rest of the paper we consider only attribute-reduced contexts. 

This definition of the simplicity order on contexts can be related to context 
bonds [4] in the following way. Three formal contexts = {Gi, Mi, li) form a 
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Figure 4: An example of a projection that can increase the number of attributes 
in the minimal represenation context. 


bond if Ki <5 K 2 and <5 Kl’, where = (M, Simplicity order 

can also be considered as a generalization of “closed-relation-of’ order between 
contexts: 

Definition 9 (Definition 50 from [4]). A binary relation J C I is called a closed 
relation of the context (G, M, I) if every concept of the context (G, M, J) is also 
a concept of (G, M, I). 

From Definitions [5] and [S] it can be seen that if J is a closed relation of 
{G,M,I), then {G,M,J) <5 {G,M,I), but not always in the other direction. 
The following theorem gives a relation between kernel operators of D and the 
change in the representation context of o-projected pattern structures. 

Theorem 5. Given a pattern structure P = (G, {D, n), J) such that {D, fl) is a 
complete semilattice the following holds: 

1. for any projection ip of D we have R(^(P)) <5 R(P). 

2. for any context IK = (G, M, I) such that IK <s K.(P), there is a projection 
Ip of D such that IK is a representation context ofip(F). 

Proof. 1. The first statement follows from the fact that any extent of '0(P) 
is an extent of P (Proposition [T]). 

2. Given a pattern structure P and a context IK such that K <s M(P), let 
us define the set Dm = {d G D \ {3m € M){m'y = d} (notice that 
for K and P there is the same set G, thus, given A C G, both A' and 
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are defined in K and P correspondingly). Since K <s ]R(P), m' is 
an extent of P. Thus, we can see that there is a bijection between Dm 
and M given by m' = <D. We denote this bijection by f{m) = d, i.e. 
/(to) = d ^ m’ = (F. Correspondingly, given a subset N C M, we 
denote by /(TV) = {d G Dm \ /“^(d) G TV}, i.e., f{M) = Dm- 

Let us dehne = {d G D | {3X C Dm)\_\X = d}. According to 
Theorem [3] there is a projection ip such that Dip = ip{D). 

Let us consider the o-projected pattern structure ip{F). The set Dm is 
U-dense for ip{D), i.e., the context {G,Dm,Idm)^ where {g,d) G Idm 
I p o S{g) □ d, is a representation context of ip{¥). There is the bijection 
between Dm and M. Let us show that the relation / is similar to the 
relation Im, i.e., {g,m) G / {g,f{m)) G Idm- 

It can be seen that for all g G G and all d G f{g'), we get ip o 6{g) □ d, 
because for any d G f{g') we have g G d*. Moreover, for any d € D\ f{g') 
we have d^ipo 5(g). Thus, the context K and the context (G, Dm, Idm) 
are similar, and hence for any context K <s K(P) there is a projection 
such that K is a representation context of ip(¥). 

□ 


4 Conclusion 

In this paper we have introduced o-projections of pattern structures that are 
based on kernel operators ip : D ^ D. O-projections are a generalization of 
projections of pattern structures and allow one to change the semilattice of de¬ 
scriptions in o-projected pattern structures. Thus, the complexity of similarity 
(semilattice) operation can be reduced. Moreover, O-projections also correct a 
formal problem of projections. 

We have shown that o-projections form a semilattice. This can be important 
when several independent o-projections are applied to a pattern structure. For 
example, if projections are discussed with several experts it may happen that 
several types of projections should be combined. In the case of several inde¬ 
pendent projections we know that there is the only one o-projection w.r.t. the 
semilattice of o-projections that is a combination of these projections. 

Finally, we have shown that the representation context of an o-projected 
pattern structure can have more attributes than the representation context of 
the pattern structure itself. To describe this change in the representation context 
after o-projection we have introduced a new order on contexts, with the use of 
which we have described the way the representation context can change. 

An important direction of the future work is to formalize transformations 
of pattern structures, i.e., special homomorphisms between the semilattice of 
descriptions D and a different semilattice Di. In particular, it allows one to 
formalize the mappings of the form ip : D ^ an instance of which are kernel 
functions used in Support Vector Machines (SVM). 


15 


Acknowledgments: this research was supported by the Basic Research Pro¬ 
gram at the National Research University Higher School of Economics (Moscow, 

Russia) and by the Biointelligence project (France). The second author was also 

supported by a grant from Russian Foundation for Basic Research, grant no. 

13-0700504. 

References 

[1] Aleksey Buzmakov, Elias Egho, Nicolas Jay, Sergei O. Kuznetsov, Amedeo 
Napoli, and Chedy Ra'issi. On Projections of Sequential Pattern Structures 
(with an application on care trajectories). In Proc. 10th Int. Conf. Concept 
Lattices Their Appl., pages 199-208, 2013. 

[2] Bernhard Ganter, Peter A. Grigoriev, Sergei O. Kuznetsov, and Mikhail V. 
Samokhin. Concept-Based Data Mining with Scaled Labeled Graphs. In 
KarlErich Wolff, HeatherD. Pfeiffer, and HarryS. Delugach, editors. Con¬ 
cept. Struct. Work SE - 6, volume 3127 of Lecture Notes in Computer Sci¬ 
ence, pages 94-108. Springer Berlin Heidelberg, 2004. 

[3] Bernhard Ganter and Sergei O. Kuznetsov. Pattern Structures and Their 
Projections. In Harry S. Delugach and Gerd Stumme, editors. Concept. 
Struct. Broadening Base, volume 2120 of Lecture Notes in Computer Science, 
pages 129-142. Springer Berlin Heidelberg, 2001. 

[4] Bernhard Ganter and Rudolf Wille. Formal Concept Analysis: Mathematical 
Foundations. Springer, 1st edition, 1999. 

[5] Tim B. Kaiser and Stefan E. Schmidt. Some Remarks on the Relation 
between Annotated Ordered Sets and Pattern Structures. In Sergei O. 
Kuznetsov, DebaP. Mandal, MalayK. Kundu, and SankarK. Pal, editors. 
Pattern Recognit. Mach. Intell. SE - 9, volume 6744 of Lecture Notes in 
Computer Science, pages 43-48. Springer Berlin Heidelberg, 2011. 

[6] Mehdi Kaytoue, Sergei O. Kuznetsov, Amedeo Napoli, and Sebastien Dup- 
lessis. Mining gene expression data with pattern structures in formal concept 
analysis. Inf. Sci. (Ny)., 181(10):1989 - 2001, 2011. 

[7] Sergei O. Kuznetsov. A fast algorithm for computing all intersections of ob¬ 
jects from an arbitrary semilattice. Nauchno-Tekhnicheskaya Informatsiya 
Seriya 2 (Autom. Doc. Mathem. Ling.), No. 1:17-20, 1993. 

[8] Sergei O. Kuznetsov and Mikhail V. Samokhin. Learning Closed Sets of 
Labeled Graphs for Chemical Applications. In Stefan Kramer and Bernhard 
Pfahringer, editors. Inductive Log. Program. SE - 12, volume 3625 of Lec¬ 
ture Notes in Computer Science, pages 190-208. Springer Berlin Heidelberg, 
lecture no edition, 2005. 


16 



